On the automatic translation of noun compounds: challenges and strategies
نویسنده
چکیده
Several aspects of compounds make them particularly difficult to handle in a system performing automatic translation, being it a machine translation system or a system for cross-language information retrieval. First, the relation between the parts of a compound is implicit and thus its interpretation is never wholly compositional. Whereas the interpretation of clauses is guided by syntactic clues, such as word order and morphemic markers, the meaning of a compound cannot be fully recovered from the surface structure. This is, in particular, a problem when translating from a language with frequent use of compounds, to a language generally preferring syntactic constructions instead, since an overt syntactic marker (usually a preposition) is then to be generated. Secondly, also languages with frequent use of compounds differ as to when the use of a compound is preferred over some other construction type that is, it seems in part to be an arbitrary decision of each language when to compound.
منابع مشابه
Equivalence in Technical Texts: The Case of Accounting Terms in English-Persian Dictionaries
Translating accounting documents, in general, and accounting terminology, in particular, is not a simple task, especially when the new terms keep created in pace with accounting developments. This study was carried out to find the most common and preferable ways to translate accounting terms from English into Persian. Also, an attempt was made to identify the frequently used patterns of word-fo...
متن کاملEquivalence in Technical Texts: The Case of Accounting Terms in English-Persian Dictionaries
Translating accounting documents, in general, and accounting terminology, in particular, is not a simple task, especially when the new terms keep created in pace with accounting developments. This study was carried out to find the most common and preferable ways to translate accounting terms from English into Persian. Also, an attempt was made to identify the frequently used patterns of word-fo...
متن کاملAutomatic acquisition of “noun+verb” idiomatic compounds in Korean*14
Song, Sanghoun. 2015. Automatic acquisition of “noun+verb” idiomatic compounds in Korean. Linguistic Research 32(1), 253-280. The state-of-the-art skills of computational linguistics pay attention to lexical semantics, because it has a potential to be used to improve language processing systems in terms of coverage as well as accuracy. In particular, utilizing multiword expressions is important...
متن کاملThe Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملOn the semantics of noun compounds
The noun compound – a sequence of nouns which function as a single noun – is very common in English texts. No language processing system should ignore expressions like steel soup pot cover if it wants to be serious about such high-end applications of computational linguistics as question answering, information extraction, text summarization, machine translation – the list goes on. Processing no...
متن کامل